Health informatics sits at the vibrant intersection of medicine, data science, and technology, transforming how we store, analyze, and utilize health information. This rapidly evolving field empowers clinicians and researchers to uncover patterns in patient data, improve diagnostic accuracy, and personalize treatment plans without getting lost in complex databases. By turning raw medical records into actionable insights, these innovations are reshaping the future of healthcare delivery and population health management.

At Gist.Science, we bridge the gap between cutting-edge research and public understanding by curating the latest preprints from medRxiv specifically within this domain. Our team processes every new submission in this category, providing both accessible plain-language explanations and detailed technical summaries to ensure the science is clear for everyone, from policymakers to curious readers. Below are the latest papers in health informatics, freshly distilled and ready for you to explore.

DR. INFO at the Point of Care: A Prospective Pilot Study of an Agentic AI Clinical Assistant

This prospective pilot study demonstrates that DR. INFO, an agentic AI clinical assistant, significantly improves time efficiency and clinical decision support for physicians in Portuguese healthcare settings, achieving high satisfaction scores and a Net Promoter Score of 81.2 without evidence of attrition bias.

Corga Da Silva, R., Romano, M., Mendes, T., Isidoro, M., Ravichandran, S., Kumar, S., van der Heijden, M., Fail, O., Gnanapragasam, V. E.2026-04-01📄 health informatics

MedScope: A Lightweight Benchmark of Open-Source Large Language Models for Medical Question Answering

This paper introduces MedScope, a lightweight benchmarking framework that systematically evaluates six open-source large language models on medical multiple-choice questions using multi-dimensional metrics and visual analyses, revealing significant performance heterogeneity and highlighting their current unsuitability for unsupervised high-risk clinical deployment despite their value as transparent baselines.

Bian, R., Cheng, W.2026-04-01📄 health informatics

Combining Token Classification With Large Language Model Revision for Age-Friendly 4M Entity Recognition From Nursing Home Text Messages: Development and Evaluation Study

This study presents and evaluates a multi-stage pipeline that combines a fine-tuned Bio-ClinicalBERT token classifier with locally deployed open-source large language models for revision, demonstrating that this hybrid approach significantly improves the accuracy and efficiency of extracting structured Age-Friendly 4M (What Matters, Medication, Mentation, and Mobility) information from informal nursing home text messages compared to single-stage models.

Amewudah, P., Popescu, M., Farmer, M. S., Powell, K. R.2026-04-01📄 health informatics

MedResearchBench: A Multi-Domain Benchmark for Evaluating AI Research Agents on Clinical Medical Research

MedResearchBench introduces the first multi-domain benchmark specifically designed to evaluate AI research agents on clinical medical tasks by leveraging public datasets and high-quality ground truth to assess performance across seven clinical domains using six medical-specific dimensions, thereby addressing the critical gap in evaluating AI's ability to conduct publication-quality, clinically sound research.

Tan, S., Tian, Z.2026-03-31📄 health informatics

VaaS is a Multi-Layer Hallucination Reduction Pipeline for AI-Assisted Science: Production Validation and Prospective Benchmarking

This paper introduces and validates VaaS, a multi-layer, cost-effective pipeline that reduces AI hallucinations in scientific citation generation to near-zero levels through iterative refinement and rigorous benchmarking, thereby enabling reliable AI-assisted biomedical research at production scale.

Sabharwal, A., Patel, M. S., Carrano, A., Rotman, M., Wierson, W., Ekker, S. C.2026-03-30📄 health informatics

Availability and Quality of Anthropometric Data in Swiss Childrens Hospitals: The SwissPedGrowth Project

The SwissPedGrowth project demonstrates the feasibility of extracting high-quality anthropometric data from heterogeneous Swiss children's hospital electronic health records for growth research, despite challenges regarding data completeness and the need for weighting to ensure population representativeness.

Leuenberger, L. M., Shoman, Y., Romero, F., Deligianni, X., Hartung, A., Mozun, R., Goebel, N., Bielicki, J. A., Burckhardt, M.-A., Latzin, P., Saner, C., Posfay-Barbe, K. M., Schwitzgebel, V., Gianno (…)2026-03-30📄 health informatics

MOE-ECG: Multi-Objective Ensemble Fusion for Robust Atrial Fibrillation Detection Using Electrocardiograms

This paper presents MOE-ECG, a multi-objective ensemble fusion framework that utilizes particle swarm optimization and Dempster-Shafer theory to simultaneously optimize predictive performance and model diversity, achieving robust and accurate atrial fibrillation detection across multiple ECG datasets.

Peimankar, A., Hossein Motlagh, N., K. Khare, S., Spicher, N., Dominguez, H., Abolghasemi, V., Fujiwara, K., Teichmann, D., Rahmani, R., Puthusserypady, S.2026-03-30📄 health informatics

HealthFormer: Dual-level time-aware Transformers for irregular electronic health record events

The paper proposes HealthFormer, a dual-level, time-aware Transformer framework pretrained on large-scale longitudinal EHRs using multi-task self-supervision to learn hierarchical, event-centric patient representations that achieve state-of-the-art performance in incident cancer prediction through straightforward fine-tuning.

Körösi-Szabo, P., Kovacs, G., Csiszarik, A., Forrai, B., Laki, J., Szocska, M., Kovats, T.2026-03-27📄 health informatics

Federated Learning Performance Depends on Site Variation in Global HIV Data Consortia

This study demonstrates that Federated Learning effectively enables privacy-preserving, multi-site machine learning for HIV care across diverse international cohorts, achieving performance comparable to centralized models while significantly outperforming local site-specific approaches.

Jackson, N. J., Yan, C., Caro-Vega, Y., Paredes, F., Ismerio Moreira, R., Cadet, S., Varela, D., Cesar, C., Duda, S. N., Shepherd, B. E., Malin, B. A.2026-03-27📄 health informatics